Workflow

Click the nodes to obtain details about each step.

pandas_profiling

Statistics

If the workflow has been executed in cluster/cloud, runtimes include the waiting time in the queue.

Configuration

Configuration files
File Code
config.yaml
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
# config used for Snakemake 


remote: true

gcloud:
    bucket: "lvt-schiphol-assignment-snakemake"
    credentials: "keys/bucket-access.json"


mlflow:
    experiment: "schiphol-snakemake"
    tracking_uri: "lvt-schiphol-assignment-snakemake/mlruns"

raw_files:
    - "flights.csv"
    - "airports.csv"


trelliscope:
    path: "trelliscope"

train_test:
    test_size:
        - 0.2
    strategy:
        - sample
        - timeseries


models:
  - baseline_average
  # - catboost_simple

Loading...